Removing Bottlenecks in Distributed Filesystems: Coda & InterMezzo as examples
نویسندگان
چکیده
Is it possible for a distributed filesystem to perform at the same speed as local disk filesystems, at least in important cases? This paper is trying to answer this question for traditional client-server distributed filesystems, not for distributed filesystems exploiting very fast networks and disk striping techniques. We claim the answer is ”yes”. Systems such as AFS, Sprite, Coda, Arla and DFS showed what can be achieved by eliminating much RPC traffic, while NFS showed how aggressive kernel optimizations can help. Performance analysis of Coda and NFS shows that in order to achieve local disk performance on read traffic the kernel needs more autonomy. In Coda this leads to satisfactory results, both in micro benchmarks and in a http server benchmark. For read/write traffic performance is worse. Many operations lead to synchronous RPCs, but a new write-back caching model can permit a client to proceed without server interference and will eliminate most of the remaining RPC traffic. Coda can profit from this, but it does not solve the performance problems entirely. AFS and Coda maintain client caches with much the same functionality as a local disk filesystem, but these caches do not enjoy the superb performance and robustness of local filesystems like ext2. Secondly, working synchronously with a heavy weight cache manager costs Coda much in perfor-
منابع مشابه
The InterMezzo File System
Is it possible to implement a distributed file system, retaining some of the advanced protocol features of systems like Coda, while getting an implementation that is an order of magnitude simpler? We claim the answer is “yes”, and InterMezzo is well on its way towards being a prototype of such a system. The key design decisions were to exploit local file systems as server storage and as a clien...
متن کاملFile Systems for Clusters from a Protocol Perspective
The protocols used by distributed file systems vary widely. The aim of this talk is to give an overview of these protocols and discuss their applicability for a cluster environment. File systems like NFS have weak semantics, making tight sharing difficult. AFS, Coda and InterMezzo give a great deal of autonomy to cluster members, and involve a persistent file cache for each system. True cluster...
متن کاملPorting the Coda File System to Windows
We first describe how the Coda distributed filesystem was ported to Windows 95 and 98. Coda consists of user level cache managers and servers and kernel level code for filesystem support. Severe reentrancy difficulties in the Win32 environment on this platform were overcome by extending the DJGPP DOS C compiler package with kernel level support for sockets and more flexible memory management. W...
متن کاملDistributed filesystem forensics: XtreemFS as a case study
Distributed filesystems provide a cost-effective means of storing high-volume, velocity and variety information in cloud computing, big data and other contemporary systems. These technologies have the potential to be exploited for illegal purposes, which highlights the need for digital forensic investigations. However, there have been few papers published in the area of distributed filesystem f...
متن کاملLow Bandwidth Optimizations for Coda
An advanced distributed file system like Coda is an integral part of today’s mobile environment. This is due to the limitations of having large storage capacities on small mobile devices, finite battery life and the ubiquitous nature of wireless networking today. However, systems such as Coda do not perform optimally on low bandwidth connections such as low-speed error prone wireless links and ...
متن کامل